SEQUENCE LISTING 



(1) GENERAL INFORMATION 
(i) APPLICANT: Lawrence, Geoffrey 
(ii) TITLE OF THE INVENTION: Novel Compounds 



(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ratner & Prestia 

(B) STREET: P.O. Box 980 

(C) CITY: Valley Forge 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP : 19482 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 96 24677.2 

(B) FILING DATE: Filed November 27, 1996 

and 

(A) APPLICATION NUMBER: GB 9709463.5 

(B) FILING DATE: May 9, 1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Prestia, Paul F. 

(B) REGISTRATION NUMBER: 23,031 

(C) REFERENCE/ DOCKET NUMBER: GH30170 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-407-0700 

(B) TELEFAX: 610-407-0701 

(C) TELEX: 846169 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGGTGCGCC CCCTGAACCC GCGACCGCTG CCGCCCGTAG TCCTGATGTT GCTGCTGCTG 60 

CTGCCGCCGT CGCCGCTGCC TCTCGCAGCC GGAGACCCCC TTCCCACAGA AAGCCGACTC 120 

ATGAACAGCT GTCTCCAGGC CAGGAGGAAG TGCCAGGCTG ATCCCACCTG CAGTGCTGCC 180 

TACCACCACC TGGATTCCTG CACCTCTAGC ATAAGCACCC CACTGCCCTC AGAGGAGCCT 240 

TCGGTCCCTG CTGACTGCCT GGAGGCAGCA CAGCAACTCA GGAACAGCTC TCTGATAGGC 300 

TGCATGTGCC ACCGGCGCAT G AAGAAC C AG GTTGCCTGCT TGGACATCTA TTGGACCGTT 360 

CACCGTGCCC GCAGCCTTGG TAACTATGAG CTGGATGTCT CCCCCTATGA AGACACAGTG 420 

ACCAGCAAAC CCTGGAAAAT GAATCTCAGC AAACTGAACA TGCTCAAACC AGACTCAGAC 480 

CTCTGCCTCA AGTTTGCCAT GCTGTGTACT CTCAATGACA AGTGTGACCG GCTGCGCAAG 540 

GCCTACGGGG AGGCGTGCTC CGGGCCCCAC TGCCAGCGCC ACGTCTGCCT CAGGCAGCTG 600 

CTCACTTTCT TCGAGAAGGC CGCCGAGCCC CACGCGCAGG GCCTGCTACT GTGCCCATGT 660 

GCCCCCAACG ACCGGGGCTG CGGGGAGCGC CGGCGCAACA CCATCGCCCC CAACTGCGCG 720 

CTGCCGCCTG TGGCCCCCAA CTGCCTGGAG CTGCGGCGCC TCTGCTTCTC CGACCCGCTT 780 

TGCAGATCAC GCCTGGTGGA TTTCCAGACC CACTGCCATC CCATGGACAT CCTAGGAACT 840 

TGTGCAACAG AGCAGTCCAG ATGTCTACGA GCATACCTGG GGCTGATTGG GACTGCCATG 900 

ACCCCCAACT TTGTCAGCAA TGTCAACACC AGTGTTGCCT TAAGCTGCAC CTGCCGAGGC 960 

AGTGGCAACC TGCAGGAGGA GTGTGAAATG CTGGAAGGGT TCTTCTCCCA CAACCCCTGC 1020 

CTCACGGAGG CCATTGCAGC TAAGATGCGT TTTCACAGCC AACTCTTCTC CCAGGACTGG 108 0 

CCACACCCTA CCTTTGCTGT GATGGCACAC CAGAATGAAA ACCCTGCTGT GAGGCCACAG 114 0 

CCCTGGGTGC CCTCTCTTTT CTCCTGCACG CTTCCCTTGA TTCTGCTCCT GAGCCTATGG 1200 



m (2) INFORMATION FOR SEQ ID NO: 2: 

C5 (i) SEQUENCE CHARACTERISTICS: 
%LP (A) LENGTH: 4 00 amino acids 

SJ (B) TYPE: amino acid 

SJ (C) STRANDEDNESS: single 

m ( D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



m 

e (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGGTGCGCC CCCTGAACCC GCGACCGCTG CCGCCCGTAG TCCTGATGTT GCTGCTGCTG 60 

CTGCCGCCGT CGCCGCTGCC TCTCGCAGCC GGAGACCCCC TTCCCACAGA AAGCCGACTC 120 

ATGAACAGCT GTCTCCAGGC CAGGAGGAAG TGCCAGGCTG ATCCCACCTG CAGTGATGCC 180 

TACCACCACC TGGATTCCTG CACCTCTAGC ATAAGCACCC CACTGCCCTC AGAGGAGCCT 240 

TCGGTCCCTG CTGACTGCCT GGAGGCAGCA CAGCAACTCA GGAACAGCTC TCTGATAGGC 300 

TGCATGTGCC ACCGGCGCAT GAAGAAC CAG GTTGCCTGCT TGGACATCTA TTGGACCGTT 360 

CACCGTGCCC GCAGCCTTGG TAACTATGAG CTGGATGTCT CCCCCTATGA AGACACAGTG 420 

ACCAGCAAAC CCTGGAAAAT GAATCTCAGC AAACT GAACA TGCTCAAACC AGACTCAGAC 4 80 

CTCTGCCTCA AGTTTGCCAT GCTGTGTACT CTCAATGACA AGTGTGACCG GCTGCGCAAG 540 

GCCTACGGGG AGGCGTGCTC CGGGCCCCAC TGCCAGCGCC ACGTCTGCCT CAGGCAGCTG 600 

CTCACTTTCT TCGAGAAGGC CGCCGAGCCC CACGCGCAGG GCCTGCTACT GTGCCCATGT 660 

GCCCCCAACG ACCGGGGCTG CGGGGAGCGC CGGCGCAACA CCATCGCCCC CAACTGCGCG 720 

CTGCCGCCTG TGGCCCCCAA CTGCCTGGAG CTGCGGCGCC TCTGCTTCTC CGACCCGCTT 780 

TGCAGATCAC GCCTGGTGGA TTTCCAGACC CACTGCCATC CCATGGACAT CCTAGGAACT 84 0 

TGTGCAACAG AGCAGTCCAG ATGTCTACGA GCATACCTGG GGCTGATTGG GACTGCCATG 900 

ACCCCCAACT TTGTCAGCAA TGTCAACACC AGTGTTGCCT TAAGCTGCAC CTGCCGAGGC 960 

AGTGGCAACC TGCAGGAGGA GTGTGAAATG CTGGAAGGGT TCTTCTCCCA CAACCCCTGC 1020 

CTCACGGAGG CCATTGCAGC TAAGATGCGT TTTCACAGCC AACTCTTCTC CCAGGACTGG 1080 

CCACACCCTA CCTTTGCTGT GATGGCACAC CAGAATGAAA ACCCTGCTGT GAGGCCACAG 1140 

CCCTGGGTGC CCTCTCTTTT CTCCTGCACG CTTCCCTTGA TTCTGCTCCT GAGCCTATGG 1200 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAGCGCCGGC GCAACACCAT CGCCCCCAAC TGCGCGCTGC CGCCTGTGGC CCCCAACTGC 60 

CTGGAGCTGC GGCGCCTCTG CTTCTCCGAC CCGCTTTGCA GATCACGCCT GGTGGATTTC 120 

CAGACCCACT GCCATCCCAT GGACATCCTA GGAACTTGTG CAACAGAGCA GTCCAGATGT 180 

CTACGAGCAT ACCTGGGGCT GATTGGGACT GCCATGACCC CCAACTTTGT CAGCAATGTC 240 

AACACCAGTG TTGCCTTAAG CTGCACCTGC CGAGGCAGTG GCAACCTGCA GGAGGAGTGT 300 

GAAATGCTGG AAGGGTTCTT CTCCCACAAC CCCTGCCTCA CGGAGGCCAT TGCAGCTAAG 360 

ATGCGTTTTC ACAGCCAACT CTTCTCCCAG GACTGGCCAC ACCCTACCTT TGCTGTGATG 420 

GCACACCAGA ATGAAAACCC TGCTGTGAGG CCACAGCCCT GGGTGCCCTC TCTTTTCTCC 480 

TGCACGCTTC CCTTGATTCT GCTCCTGAGC CTATGGTAG 519 

y (2) INFORMATION FOR SEQ ID NO: 6: 

■2 (i) SEQUENCE CHARACTERISTICS: 

! -Jj (A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



M= (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 
(i) APPLICANT: Lawrence, Geoffrey 
(ii) TITLE OF THE INVENTION: Novel Compounds 



(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ratner & Prestia 

(B) STREET: P.O. Box 980 

(C) CITY: Valley Forge 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19482 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2. 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 96 24677.2 

(B) FILING DATE: Filed November 27, 1996 

and 

(A) APPLICATION NUMBER: GB 9709463.5 

(B) FILING DATE: May 9, 1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Prestia, Paul F. 

(B) REGISTRATION NUMBER: 23,031 

(C) REFERENCE/ DOCKET NUMBER: GH3017 0 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-407-0700 

(B) TELEFAX: 610-407-0701 

(C) TELEX: 846169 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGGTGCGCC CCCTGAACCC GCGACCGCTG CCGCCCGTAG TCCTGATGTT GCTGCTGCTG 60 

CTGCCGCCGT CGCCGCTGCC TCTCGCAGCC GGAGACCCCC TTCCCACAGA AAGCCGACTC 120 

AT GAACAGCT GTCTCCAGGC CAG GAGGAAG TGCCAGGCTG ATCCCACCTG CAGTGCTGCC 180 

TACCACCACC TGGATTCCTG CACCTCTAGC ATAAGCACCC CACTGCCCTC AGAGGAGCCT 240 

TCGGTCCCTG CTGACTGCCT GGAGGCAGCA CAGCAACTCA GGAACAGCTC T CT GAT AG G C 300 

TGCATGTGCC ACCGGCGCAT GAAGAAC CAG GTTGCCTGCT TGGACATCTA TTGGACCGTT 360 

CACCGTGCCC GCAGCCTTGG TAACT AT GAG CTGGATGTCT CCCCCTATGA AGACACAGTG 420 

ACCAGCAAAC CCTGGAAAAT GAATCTCAGC AAACT GAACA TGCTCAAACC AGACT CAGAC 4 80 

CTCTGCCTCA AGTTTGCCAT GCTGTGTACT CTCAATGACA AGTGTGACCG GCTGCGCAAG 54 0 

GCCTACGGGG AGGCGTGCTC CGGGCCCCAC TGCCAGCGCC ACGTCTGCCT CAGGCAGCTG 600 

CTCACTTTCT TCGAGAAGGC CGCCGAGCCC CACGCGCAGG GCCTGCTACT GTGCCCATGT 660 

GCCCCCAACG ACCGGGGCTG CGGGGAGCGC CGGCGCAACA CCATCGCCCC CAACTGCGCG 720 

CTGCCGCCTG TGGCCCCCAA CTGCCTGGAG CTGCGGCGCC TCTGCTTCTC CGACCCGCTT 7 80 

TGCAGATCAC GCCTGGTGGA TTTC CAGAC C CACTGCCATC C CAT G G AC AT CCTAGGAACT 840 

TGTGCAACAG AG CAGT C CAG ATGTCTACGA GCATACCTGG GGCTGATTGG GACT GC CAT G 900 

ACCCCCAACT TTGTCAGCAA TGTCAACACC AGTGTTGCCT TAAGCT GCAC CTGCCGAGGC 960 

AGTGGCAACC TGCAGGAGGA GTGTGAAATG CTGGAAGGGT TCTTCTCCCA CAACCCCTGC 1020 

CTCACGGAGG CCATTGCAGC TAAGATGCGT TTTCACAGCC AACTCTTCTC CCAGGACTGG 108 0 

CCACACCCTA CCTTTGCTGT GAT GGC AC AC CAGAAT GAAA ACCCTGCTGT GAGGCCACAG 1140 

CCCTGGGTGC CCTCTCTTTT CTCCTGCACG CTTCCCTTGA TTCTGCTCCT GAGCCTATGG 1200 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met 


Val 


Arg 


Pro 


Leu 


Asn 


Pro 


Arg 


Pro 


Leu 


Pro 


Pro 


Val 


Val 


Leu 


Met 


1 






5 










10 










15 




Leu 


Leu 


Leu 


Leu 
20 


Leu 


Pro 


Pro 


Ser 


Pro 
25 


Leu 


Pro 


Leu 


Ala 


Ala 
30 


Gly 


Asp 


Pro 


Leu 


Pro 
35 


Thr 


Glu 


Ser 


Arg 


Leu 
40 


Met 


Asn 


Ser 


Cys 


Leu 
45 


Gin 


Ala 


Arg 


Arg 


Lys 


Cys 


Gin 


Ala 


Asp 


Pro 


Thr 


Cys 


Ser 


Ala 


Ala 


Tyr 


His 


His 


Leu 


50 










55 










60 










Asp 


Ser 


Cys 


Thr 


Ser 


Ser 


He 


Ser 


Thr 


Pro 


Leu 


Pro 


Ser 


Glu 


Glu 


Pro 


65 








70 










75 










80 


Ser 


Val 


Pro 


Ala 


Asp 
85 


Cys 


Leu 


Glu 


Ala 


Ala 
90 


Gin 


Gin 


Leu 


Arg 


Asn 
95 


Ser 


Ser 


Leu 


lie 


Gly 
100 


Cys 


Met 


Cys 


His 


Arg 
105 


Arg 


Met 


Lys 


Asn 


Gin 
110 


Val 


Ala 


Cys 


Leu 


Asp 
115 


He 


Tyr 


Trp 


Thr 


Val 
120 


His 


Arg 


Ala 


Arg 


Ser 
125 


Leu 


Gly 


Asn 


Tyr 


Glu 


Leu 


Asp 


Val 


Ser 


Pro 


Tyr 


Glu 


Asp 


Thr 


Val 


Thr 


Ser 


Lys 


Pro 


130 










135 










140 










Trp 


Lys 


Met 


Asn 


Leu 


Ser 


Lys 


Leu 


Asn 


Met 


Leu 


Lys 


Pro 


Asp 


Ser 


Asp 


145 








150 










155 










160 


Leu 


Cys 


Leu 


Lys 


Phe 


Ala 


Met 


Leu 


Cys 


Thr 


Leu 


Asn 


Asp 


Lys 


Cys 


Asp 






165 










170 










175 




Arg 


Leu 


Arg 


Lys 
180 


Ala 


Tyr 


Gly 


Glu 


Ala 
185 


Cys 


Ser 


Gly 


Pro 


His 
190 


Cys 


Gin 


Arg 


His 


Val 


Cys 


Leu 


Arg 


Gin 


Leu 


Leu 


Thr 


Phe 


Phe 


Glu 


Lys 


Ala 


Ala 







195 










200 










205 








Glu 


Pro 


His 


Ala 


Gin 


Gly 


Leu 


Leu 


Leu 


Cys 


Pro 


Cys 


Ala 


Pro 


Asn 


Asp 




210 










215 










220 








Arg 


Gly 


Cys 


Gly 


Glu 


Arg 


Arg 


Arg 


Asn 


Thr 


He 


Ala 


Pro 


Asn 


Cys 


Ala 


225 










230 










235 








240 


Leu 


Pro 


Pro 


Val 


Ala 


Pro 


Asn 


Cys 


Leu 


Glu 


Leu 


Arg 


Arg 


Leu 


Cys 


Phe 










245 










250 










255 




Ser 


Asp 


Pro 


Leu 


Cys 


Arg 


Ser 


Arg 


Leu 


Val 


Asp 


Phe 


Gin 


Thr 


His 


Cys 








260 










265 










270 




His 


Pro 


Met 


Asp 


He 


Leu 


Gly 


Thr 


Cys 


Ala 


Thr 


Glu 


Gin 


Ser 


Arg 


Cys 






275 










280 










285 




Leu 


Arg 


Ala 


Tyr 


Leu 


Gly 


Leu 


He 


Gly 


Thr 


Ala 


Met 


Thr 


Pro 


Asn 


Phe 




290 










295 










300 










Val 


Ser 


Asn 


Val 


Asn 


Thr 


Ser 


Val 


Ala 


Leu 


Ser 


Cys 


Thr 


Cys 


Arg 


Gly 


305 










310 










315 






320 


Ser 


Gly Asn 


Leu 


Gin 


Glu 


Glu 


Cys 


Glu 


Met 


Leu 


Glu 


Gly 


Phe 


Phe 


Ser 










325 










330 










335 




His 


Asn 


Pro 


Cys 


Leu 


Thr 


Glu 


Ala 


He 


Ala 


Ala 


Lys 


Met 


Arg 


Phe 


His 








340 










345 










350 






Ser 


Gin 


Leu 


Phe 


Ser 


Gin 


Asp 


Trp 


Pro 


His 


Pro 


Thr 


Phe 


Ala 


Val 


Met 






355 










360 










365 








Ala 


His 


Gin 


Asn 


Glu 


Asn 


Pro 


Ala 


Val 


Arg 


Pro 


Gin 


Pro 


Trp 


Val 


Pro 




370 










375 










380 








Ser 


Leu 


Phe 


Ser 


Cys 


Thr 


Leu 


Pro 


Leu 


He 


Leu 


Leu 


Leu 


Ser 


Leu 


Trp 


385 










390 










395 










400 



^ (2) INFORMATION FOR SEQ ID NO : 3 : 

w 

CP (i) SEQUENCE CHARACTERISTICS: 

l"y (A) LENGTH: 1200 base pairs 

« (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

J 7 (D) TOPOLOGY: linear 



(iij MOLECULE TYPE: cDNA 



m 
in 

sQ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGGTGCGCC CCCTGAACCC GCGACCGCTG CCGCCCGTAG TCCTGATGTT GCTGCTGCTG 60 

CTGCCGCCGT CGCCGCTGCC TCTCGCAGCC GGAGACCCCC TTCCCACAGA AAGCCGACTC 120 

AT GAAC AGCT GTCTCCAGGC CAGGAGGAAG TGCCAGGCTG ATCCCACCTG CAGTGATGCC 180 

TACCACCACC TGGATTCCTG CACCTCTAGC ATAAGCACCC CACTGCCCTC AGAGGAGCCT 240 

TCGGTCCCTG CTGACTGCCT GGAGGCAGCA CAGCAACTCA GGAACAGCTC TCTGATAGGC 3 00 

TGCATGTGCC ACCGGCGCAT GAAGAAC C AG GTTGCCTGCT TGGACATCTA TTGGACCGTT 360 

CACCGTGCCC GCAGCCTTGG TAACTAT GAG CTGGATGTCT CCCCCTATGA AGACACAGTG 42 0 

ACCAGCAAAC CCTGGAAAAT GAATCTCAGC AAACT GAAC A TGCTCAAACC AGACTCAGAC 4 80 

CTCTGCCTCA AGTTTGCCAT GCTGTGTACT CTCAATGACA AGTGTGACCG GCTGCGCAAG 54 0 

GCCTACGGGG AGGCGTGCTC CGGGCCCCAC TGCCAGCGCC ACGTCTGCCT CAGGCAGCTG 600 

CTCACTTTCT TCGAGAAGGC CGCCGAGCCC CACGCGCAGG GCCTGCTACT GTGCCCATGT 660 

GCCCCCAACG ACCGGGGCTG CGGGGAGCGC CGGCGCAACA CCATCGCCCC CAACTGCGCG 72 0 

CTGCCGCCTG TGGCCCCCAA CTGCCTGGAG CTGCGGCGCC TCTGCTTCTC CGACCCGCTT 7 80 

T G C AG AT C AC GCCTGGTGGA TTTCCAGACC CACTGCCATC C CAT GGACAT CCTAGGAACT 840 

TGTGCAACAG AGCAGTCCAG ATGTCTACGA GCATACCTGG GGCTGATTGG GACTGCCATG 900 

ACCCCCAACT TTGTCAGCAA TGTCAACACC AGTGTTGCCT TAAGCTGCAC CTGCCGAGGC 960 

AGTGGCAACC TGCAGGAGGA GTGTGAAATG CTGGAAGGGT TCTTCTCCCA CAACCCCTGC 1020 

CTCACGGAGG CCATTGCAGC TAAGATGCGT TTTCACAGCC AACTCTTCTC CCAGGACTGG 108 0 

CCACACCCTA CCTTTGCTGT GAT GGC AC AC C AG AAT G AAA ACCCTGCTGT GAGGCCACAG 1140 

CCCTGGGTGC CCTCTCTTTT CTCCTGCACG CTTCCCTTGA TTCTGCTCCT GAGCCTATGG 1200 



(2) INFORMATION FOR SEQ ID NO : 4 : 



# 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Val Arg Pro Leu Asn Pro Arg Pro Leu Pro Pro Val Val Leu Met 

15 10 15 

Leu Leu Leu Leu Leu Pro Pro Ser Pro Leu Pro Leu Ala Ala Gly Asp 

20 25 30 

Pro Leu Pro Thr Glu Ser Arg Leu Met Asn Ser Cys Leu Gin Ala Arg 

35 40 45 

Arg Lys Cys Gin Ala Asp Pro Thr Cys Ser Asp Ala Tyr His His Leu 

50 55 60 

Asp Ser Cys Thr Ser Ser He Ser Thr Pro Leu Pro Ser Glu Glu Pro 

65 70 75 80 

Q Ser Val Pro Ala Asp Cys Leu Glu Ala Ala Gin Gin Leu Arg Asn Ser 

Eg 85 90 95 

yj Ser Leu Ile Gly Cys Met Cys His Ar< 3 Ar< ? Met L V S Asn Gin Val Ala 

rt 100 105 no 

^ Cys Leu Asp Ile Tyr Trp Thr Val His Arg Ala Arg Ser Leu Gly Asn 

>i 115 120 125 

fTJ Tyr Glu Leu Asp Val Ser Pro Tyr Glu Asp Thr Val Thr Ser Lys Pro 

01 130 135 140 

fy Trp Lys Met Asn Leu Ser Lys Leu Asn Met Leu Lys Pro Asp Ser Asp 

L 145 150 155 160 

l. Leu Cys Leu Lys Phe Ala Met Leu Cys Thr Leu Asn Asp Lys Cys Asp 

L 165 170 175 

^ Arg Leu Arg Lys Ala Tyr Gly Glu Ala Cys Ser Gly Pro His Cys Gin 

fW 180 185 190 

U1 Ar< ? His Va l Cys Leu Arg Gin Leu Leu Thr Phe Phe Glu Lys Ala Ala 

195 200 205 

Glu Pro His Ala Gin Gly Leu Leu Leu Cys Pro Cys Ala Pro Asn Asp 

210 215 220 

Arg Gly Cys Gly Glu Arg Arg Arg Asn Thr Ile Ala Pro Asn Cys Ala 
225 2 30 235 240 

Leu Pro Pro Val Ala Pro Asn Cys Leu Glu Leu Arg Arg Leu Cys Phe 

245 250 255 

Ser Asp Pro Leu Cys Arg Ser Arg Leu Val Asp Phe Gin Thr His Cys 

260 265 270 

His Pro Met Asp Ile Leu Gly Thr Cys Ala Thr Glu Gin Ser Ara Cys 

275 280 285 

Leu Arg Ala Tyr Leu Gly Leu Ile Gly Thr Ala Met Thr Pro Asn Phe 

290 295 300 

Val Ser Asn Val Asn Thr Ser Val Ala Leu Ser Cys Thr Cys Arq Glv 
3 °5 310 315 320 

Ser Gly Asn Leu Gin Glu Glu Cys Glu Met Leu Glu Gly Phe Phe Ser 

325 330 335 

His Asn Pro Cys Leu Thr Glu Ala Ile Ala Ala Lys Met Arg Phe His 

340 345 350 

Ser Gin Leu Phe Ser Gin Asp Trp Pro His Pro Thr Phe Ala Val Met 

355 360 365 

Ala His Gin Asn Glu Asn Pro Ala Val Arg Pro Gin Pro Trp Val Pro 

370 375 380 

Ser Leu Phe Ser Cys Thr Leu Pro Leu Ile Leu Leu Leu Ser Leu Trp 



SJ 




385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAGCGCCGGC G C AACAC CAT CGCCCCCAAC TGCGCGCTGC CGCCTGTGGC CCCCAACTGC 60 

CTGGAGCTGC GGCGCCTCTG CTTCTCCGAC CCGCTTTGCA GATCACGCCT GGTGGATTTC 120 

CAGACCCACT GCCATCCCAT GGACATCCTA GGAACTTGTG CAACAGAGCA GTCCAGATGT 18 0 

CTAC GAG CAT ACCTGGGGCT GATTGGGACT GCCATGACCC CCAACTTTGT CAGCAATGTC 24 0 

AACAC CAGT G TTGCCTTAAG CTGCACCTGC CGAGGCAGTG GCAACCTGCA GGAGGAGTGT 300 

GAAATGCTGG AAGGGTTCTT CTCCCACAAC CCCTGCCTCA CGGAGGCCAT TGCAGCTAAG 360 

ATGCGTTTTC ACAGCCAACT CTTCTCCCAG GACTGGCCAC ACCCTACCTT TGCTGTGATG 42 0 

m GCACACCAGA ATGAAAACCC TGCTGTGAGG CCACAGCCCT GGGTGCCCTC TCTTTTCTCC 48 0 

r? TGCACGCTTC CCTTGATTCT GCTCCTGAGC CTATGGTAG 519 

m 

**Q (2) INFORMATION FOR SEQ ID NO: 6: 

SI- 
SI (i) SEQUENCE CHARACTERISTICS: 
gj (A) LENGTH: 172 amino acids 
ffi (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
iy (D) TOPOLOGY: linear 

3 

H 1 (ii) MOLECULE TYPE: peptide 

ftj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

iri 



Glu 


Arg 


Arg 


Arg 


Asn 


Thr 


He 


Ala 


Pro 


Asn 


Cys 


Ala 


Leu 


Pro 


Pro 


Val 


1 








5 










10 










15 




Ala 


Pro 


Asn 


Cys 
20 


Leu 


Glu 


Leu 


Arg 


Arg 
25 


Leu 


Cys 


Phe 


Ser 


Asp 
30 


Pro 


Leu 


c y s 


Arg 


Ser 
35 


Arg 


Leu 


Val 


Asp 


Phe 
40 


Gin 


Thr 


His 


Cys 


His 
45 


Pro 


Met 


Asp 


He 


Leu 
50 


Gly 


Thr 


Cys 


Ala 


Thr 
55 


Glu 


Gin 


Ser 


Arg 


Cys 
60 


Leu 


Arg 


Ala 


Tyr 


Leu 


Gly 


Leu 


He 


Gly 


Thr 


Ala 


Met 


Thr 


Pro 


Asn 


Phe 


Val 


Ser 


Asn 


Val 


65 










70 










75 










80 


Asn 


Thr 


Ser 


Val 


Ala 


Leu 


Ser 


Cys 


Thr 


Cys 


Arg 


Gly 


Ser 


Gly Asn 


Leu 










85 










90 










95 




Gin 


Glu 


Glu 


Cys 


Glu 


Met 


Leu 


Glu 


Gly 


Phe 


Phe 


Ser 


His 


Asn 


Pro 


Cys 








100 










105 










110 




Leu 


Thr 


Glu 
115 


Ala 


He 


Ala 


Ala 


Lys 
120 


Met 


Arg 


Phe 


His 


Ser 
125 


Gin 


Leu 


Phe 


Ser 


Gin 
130 


Asp 


Trp 


Pro 


His 


Pro 
135 


Thr 


Phe 


Ala 


Val 


Met 
140 


Ala 


His 


Gin 


Asn 


Glu 


Asn 


Pro 


Ala 


Val 


Arg 


Pro 


Gin 


Pro 


Trp 


Val 


Pro 


Ser 


Leu 


Phe 


Ser 


145 










150 










155 










160 


Cys 


Thr 


Leu 


Pro 


Leu 
165 


He 


Leu 


Leu 


Leu 


Ser 
170 


Leu 


Trp 











w 

m 

m 



m 

03 



GH30170 



Figure I - SEQ ID NO:l 



- m 



1 


ATGGTGCGCC 


CCCTGAACCC 


GCGACCGCTG 


CCGCCCGTAG 


TCCTGATGTT 


51 


GCTGCTGCTG 


CTGCCGCCGT 


CGCCGCTGCC 


TCTCGCAGCC 


GGAGACCCCC 


101 


TTCCCACAGA 


AAGCCGACTC 


ATGAACAGCT 


GTCTCCAGGC 


CAGGAGGAAG 


- 151 


TGCCAGGCTG 


ATCCCACCTG 


CAGTGCTGCC 


TACCACCACC 


TGGATTCCTG 


201 


CACCTCTAGC 


ATAAGCACCC 


CACTGCCCTC 


AGAGGAGCCT 


TCGGTCCCTG 


251 


CTGACTGCCT 


GGAGG CAGCA 


CAGCAACTCA 


GGAACAGCTC 


TCTGATAGGC 


301 


TGCATGTGCC 


ACCGGCGCAT 


GAAGAACCAG 


GTTGCCTGCT 


TGGAcATCTA 


351 


TTGGAC CGTT 


CACCGTGCCC 


GCAGCCTTGG 


TAACTATGAG 


CTGGATGTCT 


401 


CCCCCTATGA 


AGACACAGTG 


ACCAGCAAAC 


CCTGGAAAAT 


GAATCTCAGC 


451 


AAACTGAACA 


TGCTCAAACC 


AGACTCAGAC 


CTCTGCCTCA 


AGTTTGCCAT 


501 


GCTGTGTACT 


CTCAATGACA 


AGTGTGACCG 


GCTGCGCAAG 


GCCTACGGGG 


551 


AGGCGTGCTC 


CGGGCCCCAC 


TGCCAGCGCC 


ACGTCTGCCT 


CAGGC AG CTG 


601 


CTCACTTTCT 


TCGAGAAGGC 


CGCCGAGCCC 


CACGCGCAGG 


GCCTGCTACT 


651 


GTGCCCATGT 


GCCCCCAACG 


AC CGGGGCTG 


CGGGGAGCGC 


CGGCGCAACA 


701 


CCATCGCCCC 


CAACTGCGCG 


CTGCCGCCTG 


TGGCCCCCAA 


CTGCCTGGAG 


751 


CTGCGGCGCC 


TCTGCTTCTC 


CGACCCGCTT 


TGCAGATCAC 


GCCTGGTGGA 


801 


TTTCCAGACC 


CACTGCCATC 


CCATGGACAT 


CCTAGGAACT 


TGTGCAACAG 


851 


AGCAGTCCAG 


ATGTCTACGA 


GCATACCTGG 


GGCTGATTGG 


GACTGCCATG 


901 


ACCCCCAACT 


TTGTCAGCAA 


TGTCAACACC 


AGTGTTGCCT 


TAAGCTGCAC 


951 


CTGCCGAGGC 


AGTGGCAACC 


TGCAGGAGGA 


GTGTGAAATG 


CTGGAAGGGT 


1001 


TCTTCTCCCA 


CAACCCCTGC 


CTCACGGAGG 


CCATTGCAGC 


TAAGATGCGT 


1051 


TTTCACAGCC 


AACTCTTCTC 


CCAGGACTGG 


CCACACCCTA 


CCTTTGCTGT 


1101 
1151 


GATGGCACAC 
CCTCTCTTTT 


CAGAATGAAA 
CTCCTGCACG 


ACCCTGCTGT 
CTTCCCTTGA 


GAGGCCACAG 
TTCTGCTCCT 


CCCTGGGTGC 
GAGCCTATGG 


re 2 - SEQ ID NO:2 










l 


MVRPLNPRPL 


P P WLMLLLLi 


LPPSPLPLAA 


GDPLPTESRL 


MNS CLQARRK 


51 


CQADPTCSAA 


YHHLDSCTSS 


ISTPLPSEEP 


SVPADCLEAA 


QQLRNSSLIG 


101 


CMCHRRMKNQ 


VACLDIYWTV 


HRARSLGNYE 


LDVSPYEDTV 


TSKPWKMNLS 


151 


KLNMLKPDSD 


LCLKFAMLCT 


LNDKCDRLRK 


AYGEACSGPH 


CQRHVCLRQL 


201 


LTFFEKAAEP 


HAQGLLLCPC 


APNDRGCGER 


RRNTIAPNCA 


LPPVAPNCLE 


251 


LRRLCFSDPL 


CRSRLVDFQT 


HCHPMDILGT 


CATEQSRCLR 


AYLGLIGTAM 


301 
351 


TPNFVSNVNT 
FHSQLFSQDW 


SVALSCTCRG 
PHPTFAVMAH 


SGNLQEECEM 
QNENPAVRPQ 


LEGFFSHNPC 
PWVPSLFSCT 


LTEAIAAKMR 
LPLILLLSLW 



1 



GH30170 



Figure 3 - SEQ ED NO:3 

1 ATGGTGCGCC CCCTGAACCC GCGAC CGCTG CCGCCCGTAG TCCTGATGTT 

51 GCTGCTGCTG CTGCCGCCGT CGCCGCTGCC TCTCGCAGCC GGAGACCCCC 

101 TTCC CACAGA AAGCCGACTC ATGAACAGCT GTCTCCAGGC CAGGAGGAAG 

151 TGCCAGGCTG ATCCCACCTG CAGTGATGCC TACCACCACC TGGATTCCTG 

2 01 CACCTCTAGC ATAAGCACCC CACTGCCCTC AGAGGAGCCT TCGGTCCCTG 

2 51 CTGACTGCCT GGAGGCAGCA CAGCAACTCA GGAACAGCTC TCTGATAGGC 

3 01 TGCATGTGCC ACCGGCGCAT GAAGAACCAG GTTGCCTGCT TGGAcATCTA 

3 51 TTGGACCGTT CACCGTGCCC GCAGCCTTGG TAACTATGAG CTGGATGTCT 

4 01 CCCCCTATGA AGACACAGTG ACCAGCAAAC CCTGGAAAAT GAATCTCAGC 
451 AAACTGAACA TGCTCAAACC AGACTCAGAC CTCTGCCTCA AGTTTGCCAT 
501 GCTGTGTACT CTCAATGACA AGTGTGACCG GCTGCGCAAG GCCTACGGGG 
551 AGGCGTGCTC CGGGCCCCAC TGCCAGCGCC ACGTCTGCCT CAGGCAGCTG 
601 CTCACTTTCT TCGAGAAGGC CGCCGAGCCC CACGCGCAGG GCCTGCTACT 
651 GTGCCCATGT GCCCCCAACG ACCGGGGCTG CGGGGAGCGC CGGCGCAACA 
701 CCATCGCCCC CAACTGCGCG CTGCCGCCTG TGGCCCCCAA CTGC CTGGAG 
751 CTGCGGCGCC TCTGCTTCTC CGACCCGCTT TGCAGATCAC GCCTGGTGGA 
801 TTTCCAGACC CACTGCCATC CCATGGACAT CCTAGGAACT TGTGCAACAG 
851 AGCAGTCCAG ATGTCT AC GA GC AT AC CTGG GGCTGATTGG GACTGCCATG 
901 ACCCCCAACT TTGT CAGCAA TGTCAACACC AGTGTTGCCT TAAGCTGCAC 
951 CTGCCGAGGC AGTGGCAACC TGCAGGAGGA GTGTGAAATG CTGGAAGGGT 

1001 TCTTCTCCCA CAACCCCTGC CTCACGGAGG CCATTGCAGC TAAGATGCGT 

1051 TTTCACAGCC AACTCTTCTC CCAGGACTGG CCACACCCTA CCTTTGCTGT 

1101 GATGGCACAC CAGAATGAAA ACCCTGCTGT GAGGCCACAG CCCTGGGTGC 

1151 CCTCTCTTTT CTCCTGCACG CTTCCCTTGA TTCTGCTCCT GAGC CTATGG 

Figure 4 - SEQ ID NO:4 

1 MVRPLNPRPL PPWLMLLLL LPPSPLPLAA GDPLPTESRL MNSCLQARRK 

51 CQADPTCSDA YHHLDSCTSS ISTPLPSEEP SVPADCLEAA QQLRNSSLIG 

101 CMCHRRMKNQ VACLDIYWTV HRARS LGNYE LDVSPYEDTV TSKPWKMNLS 

151 KLNMLKPDSD LCLKFAMLCT LNDKCDRLRK AYGEACSGPH CQRHVCLRQL 

201 LTFFEKAAEP HAQGLLLCPC APNDRGCGER RRNTIAPNCA LPPVAPNCLE 

251 LRRLCFSDPL CRSRLVDFQT HCHPMDILGT CATEQSRCLR AYLGLIGTAM 

3 01 TPNFVSNVNT SVALSCTCRG SGNLQEECEM LEGFFSHNPC LTEAIAAKMR 

351 FHSQLFSQDW PHPTFAVMAH QNENPAVRPQ PWVPSLFSCT LPLILLLSLW 



GH30170 




Figure 5 - SEQ ID NO:5 (Partial coding cDNA sequence for human GDNF a3 



receptor) 










X 


GAGCGCCGGC 


GCAACACCAT 


CGCCCCCAAC 


TGCGCGCTGC 


CGCCTGTGGC 


*J X 


CCCCAACTGC 


CTGGAGCTGC 


GGCGCCTCTG 


CTTCTCCGAC 


CCGCTTTGCA 


X w X 


GATCACGCCT 


GGTGGATTTC 


CAGACCCACT 


GCCATCCCAT 


GGACATCCTA 


1 SI 


GGAACTTGTG 

\jvjnnv x x vj x vj 


CAACAGAGCA 


GTCCAGATGT 


CTACGAGCAT 


ACCTGGGGCT 


4i U 1 


GATTGGGACT 

VJil X X VJVJVJ*i\/ X 


GCCATGACCC 


CCAACTTTGT 


CAGCAATGTC 


AACACCAGTG 


251 


TTGCCTTAAG 


CTGCACCTGC 


CGAGGCAGTG 


GCAACCTGCA 


GGAGGAGTGT 


301 


GAAATGCTGG 


AAGGGTTCTT 


CTCCCACAAC 


CCCTGCCTCA 




351 


TGCAGCTAAG 


ATGCGTTTTC 


ACAGCCAACT 


CTTCTCCCAG 


GACTGGCCAC 


401 


ACCCTACCTT 


TGCTGTGATG 


GCACACCAGA 


ATGAAAACCC 


TGCTGTGAGG 


451 


CCACAGCCCT 


GGGTGCCCTC 


TCTTTTCTCC 


TGCACGCTTC 


CCTTGATTCT 


501 


GCTCCTGAGC 


CTATGGTAG 









Figure 6 - SEQ ID NO:6 (amino acid sequence for the partial GDNF ct3 receptor 
sequence) 

1 ERRRNTIAPN CALPPVAPNC LELRRLCFSD PLCRSRLVDF QTHCHPMDIL 
51 GTCATEQSRC LRAYLGLIGT AMTPNFVSNV NTSVALSCTC RGSGNLQEEC 
101 EMLEGFFSHN PCLTEAIAAK MRFHSQLFSQ DWPHPTFAVM AHQNENPAVR 
151 POP WVPSLFS CTLPLILLLS LW 

The sequence underlined corresponds to the predicted hydrophobic C-terminus 
characteristic of GPI anchored cell-surface receptors 



GH30170 



Figure I - SEQ ID NO:l 



1 


ATGGTGCGCC 


51 


GCTGCTGCTG 


101 


TTCCCACAGA 


151 


TGCCAGGCTG 


201 


CACCTCTAGC 


251 


CTGACTGCCT 


301 


TGCATGTGCC 


351 


TTGG AC CGTT 


401 


CCCCCTATGA 


451 


AAACTGAACA 


501 


GCTGTGTACT 


551 


AGGCGTGCTC 


601 


CTCACTTTCT 


651 


GTGCCCATGT 


701 


CCATCGCCCC 


751 


CTGCGGCGCC 


801 


TTTCCAGACC 


851 


AGCAGTCCAG 


901 


ACCCCCAACT 


951 


CTGCCGAGGC 


1001 


TCTTCTCCCA 


1051 


TTTCACAGCC 


1101 


GATGGCACAC 


1151 


CCTCTCTTTT 



Figure 2 - SEQ ED NO:2 



1 


MVRPLNPRPL 


51 


CQADPTCSAA 


101 


CMCHRRMKNQ 


151 


KLNMLKPDSD 


201 


LTFFEKAAEP 


251 


LRRLCFSDPL 


301 


TPNFVSNVNT 


351 


FHSQLFSQDW 



CCCTGAACCC GCGACCGCTG 

CTGCCGCCGT CGCCGCTGCC 

AAGCCGACTC ATGAACAGCT 

ATCCCACCTG CAGTGCTGCC 

ATAAGCACCC CACTGCCCTC 

GGAGGCAGCA CAGCAACTCA 

ACCGGCGCAT GAAGAACCAG 

CACCGTGCCC GCAGCCTTGG 

AGACACAGTG ACCAGCAAAC 

TGCTCAAACC AGACTCAGAC 

CTCAATGACA AGTGTGACCG 

CGGGCCCCAC TGCCAGCGCC 

TCGAGAAGGC CGCCGAGCCC 

GCCCCCAACG ACCGGGGCTG 

CAACTGCGCG CTGCCGCCTG 

TCTGCTTCTC CGACCCGCTT 

CACTGCCATC C C ATGG AC AT 

ATGTCTACGA GCATACCTGG 

TTGTCAGCAA TGTCAACACC 

AGTGGCAACC TGCAGGAGGA 

CAACCCCTGC CTCACGGAGG 

AACTCTTCTC CCAGGACTGG 

CAGAATGAAA ACCCTGCTGT 
CTCCTGCACG CTTCCCTTGA 



PPWLMLLLL LPPSPIiPLAA 

YHHLDSCTSS ISTPLPSEEP 

VACLDIYWTV HRARS LGNYE 

LCLKFAMLCT LNDKCDRLRK 

HAQGLLLCPC APNDRGCGER 

CRS RLVDFQT HCHPMDILGT 

SVALSCTCRG SGNLQEECEM 
PHPTFAVMAH QNENPAVRPQ 



CCGCCCGTAG TCCTGATGTT 

TCTCGCAGCC GGAGACCCCC 

GTCTCCAGGC CAGGAGGAAG 

TACCACCACC TGGATTCCTG 

AGAGGAGCCT TCGGTCCCTG 

GGAACAGCTC TCTGATAGGC 

GTTGCCTGCT TGGAcATCTA 

TAACTATGAG CTGGATGTCT 

CCTGGAAAAT GAATCTCAGC 

CTCTGCCTCA AGTTTGC CAT 

GCTGCGCAAG GCCTACGGGG 

ACGTCTGCCT CAGGCAGCTG 

CACGCGCAGG GCCTGCTACT 

CGGGGAGCGC CGGCGCAACA 

TGGCCCCCAA CTGCCTGGAG 

TGCAGATCAC GCCTGGTGGA 

CCTAGGAACT TGTGCAACAG 

GGCTGATTGG GACTGCCATG 

AGTGTTGCCT TAAGCTGCAC 

GTGTGAAATG CTGGAAGGGT 

CCATTGCAGC TAAGATGCGT 

CCACACCCTA CCTTTGCTGT 

GAGGCCACAG CCCTGGGTGC 
TTCTGCTCCT GAGCCTATGG 



GDPLPTESRL MNSCLQARRK 

SVPADCLEAA QQLRNSSLIG 

LDVSPYEDTV TSKPWKMNLS 

AYGEACSGPH CQRHVCLRQL 

RRNTIAPNCA LPPVAPNCLE 

CATEQSRCLR AYLGLIGTAM 

LEGFFSHNPC LTEAIAAKMR 
PWVPSLFSCT LPLILLLSLW 
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Figure 3 - SEQ ED NO:3 



1 


ATGGTGCGCC 


51 


GCTGCTGCTG 


101 


TTCCCACAGA 


151 


TGCCAGGCTG 


201 


CACCTCTAGC 


251 


CTGACTGCCT 


301 


TGCATGTGCC 


351 


TTGGACCGTT 


401 


CCCCCTATGA 


451 


AAACTGAACA 


501 


GCTGTGTACT 


551 


AGGCGTGCTC 


601 


CTCACTTTCT 


651 


GTGCCCATGT 


701 


CCATCGCCCC 


751 


CTGCGGCGCC 


801 


TTTCCAGACC 


851 


AGCAGTCCAG 


901 


ACCCCCAACT 


951 


CTGCCGAGGC 


1001 


TCTTCTCCCA 


1051 


TTTCACAGCC 


1101 


GATGGCACAC 


1151 


CCTCTCTTTT 



Figure 4 - SEQ ID NO:4 



1 


MVRPLNPRPL 


51 


CQADPTCSDA 


101 


CMCHRRMKNQ 


151 


KLNMLKPDSD 


201 


LTFFEKAAEP 


251 


LRRLCFSDPL 


301 


TPNFVSNVNT 


351 


FHSQLFSQDW 



CCCTGAACCC GCGACCGCTG 

CTGCCGCCGT CGCCGCTGCC 

AAGCCGACTC ATGAACAGCT 

ATCCCACCTG CAGTGATGCC 

ATAAGCACCC CACTGCCCTC 

GGAGGCAGCA CAGCAACTCA 

ACCGGCGCAT GAAGAACCAG 

CACCGTGCCC GCAGCCTTGG 

AGACACAGTG ACCAGCAAAC 

TGCTCAAACC AGACTCAGAC 

CTCAATGACA AGTGTGAC CG 

CGGGCCCCAC TGCCAGCGCC 

TCGAGAAGGC CGCCGAGCCC 

GCCCCCAACG ACCGGGGCTG 

CAACTGCGCG CTGCCGCCTG 

TCTGCTTCTC CGACCCGCTT 

CACTGCCATC CCATGGACAT 

ATGTCTACGA GCATACCTGG 

TTGTCAGCAA TGTCAACACC 

AGTGGCAACC TGCAGGAGGA 

CAACCCCTGC CTCACGGAGG 

AACTCTTCTC CCAGGACTGG 

CAGAATGAAA ACCCTGCTGT 
CTCCTGCACG CTTCCCTTGA 



PPWIiMLLLL LPPSPLPLAA 

YHHLDSCTSS ISTPLPSEEP 

VACLDIYWTV HRARS LGNYE 

LCLKFAMLCT LNDKCDRLRK 

HAQGLLLCPC APNDRGCGER 

CRSRLVDFQT HCHPMDILGT 

SVALSCTCRG SGNLQEECEM 
PHPTFAVMAH QNENPAVRPQ 



CCGCCCGTAG TCCTGATGTT 

TCTCGCAGCC GGAGACCCCC 

GTCTCCAGGC CAGGAGGAAG 

TACCACCACC TGGATTCCTG 

AGAGGAGCCT TCGGTCCCTG 

GGAACAGCTC TCTGATAGGC 

GTTGCCTGCT TGGAcATCTA 

TAACTATGAG CTGGATGTCT 

CCTGGAAAAT GAATCTCAGC 

CTCTGCCTCA AGTTTGCCAT 

GCTGCGCAAG GCCTACGGGG 

ACGTCTGCCT CAGGCAGCTG 

CACGCGCAGG GCCTGCTACT 

CGGGGAGCGC CGGCGCAACA 

TGGCCCCCAA CTGCCTGGAG 

TGCAGATCAC GCCTGGTGGA 

CCTAGGAACT TGTGCAACAG 

GGCTGATTGG GACTGCCATG 

AGTGTTGCCT TAAGCTGCAC 

GTGTGAAATG CTGGAAGGGT 

CCATTGCAGC TAAGATGCGT 

CCACACCCTA CCTTTGCTGT 

GAGGCCACAG CCCTGGGTGC 
TTCTGCTCCT GAGCCTATGG 



GDPLPTESRL MNSCLQARRK 

SVPADCLEAA QQLRNSSLIG 

LDVSPYEDTV TSKPWKMNLS 

AYGEACSGPH CQRHVCLRQL 

RRNTIAPNCA LPPVAPNCLE 

CATEQSRCLR AYLGLIGTAM 

LEGFFSHNPC LTEAIAAKMR 
PWVPSLFSCT LPLILLLSLW 
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Figure 5 - SEQ ID NO:5 (Partial coding cDNA sequence for human GDNF a3 



receptor) 










1 


GAGCGCCGGC 


GCAACACCAT 


CGCCCCCAAC 


TGCGCGCTGC 


CGCCTGTGGC 


51 


CCCCAACTGC 


CTGGAGCTGC 


GGCGCCTCTG 


CTTCTCCGAC 


CCGCTTTGCA 


101 


GATCACGCCT 


GGTGGATTTC 


CAGACCCACT 


GCCATCCCAT 


GGACATCCTA 


151 


GGAACTTGTG 


CAACAGAGCA 


GTCCAGATGT 


CTACGAGCAT 


ACCTGGGGCT 


201 


GATTGGGACT 


GCCATGACCC 


CCAACTTTGT 


CAGCAATGTC 


AACACCAGTG 


251 




CTGCACCTGC 


CGAGGCAGTG 


GCAACCTGCA 


GGAGGAGTGT 


301 


GAAATGCTGG 


AAGGGTTCTT 


CTCCCACAAC 


CCCTGCCTCA 


CGGAGGCCAT 


351 


TGCAGCTAAG 


ATGCGTTTTC 


ACAGCCAACT 


CTTCTCCCAG 


GACTGGCCAC 


401 


ACCCTACCTT 


TGCTGTGATG 


GCACACCAGA 


AT G AAAAC C C 


TGCTGTGAGG 


451 


CCACAGCCCT 


GGGTGCCCTC 


TCTTTTCTCC 


TGCACGCTTC 


CCTTGATTCT 


501 


GCTCCTGAGC 


CTATGGTAG 









Figure 6 - SEQ ID NO:6 (amino acid sequence for the partial GDNF cc3 receptor 
sequence) 



1 ERRRNTIAPN CALPPVAPNC LELRRLCFSD PLCRSRLVDF QTHCHPMDIL 

51 GTCATEQSRC LRAYLGLIGT AMT PNFVSNV NTSVALSCTC RGSGNLQEEC 

101 EMLEGFFSHN PCLTEAIAAK MRFHSQLFSQ DWPHPTFAVM AHQNENPAVR 

35 151 PQP WVPSLFS CTLPLILLLS LW 

The sequence underlined corresponds to the predicted hydrophobic C-terminus 
characteristic of GPI anchored cell-surface receptors 
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